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' 1 Informal Mathematical ' Procedures 

The work on informal mathematical procedures is being carried out 
as outlined on page 82 of the original project proposal. We 'quote from 
that page: 

• ' : ^ ■ 

• " . . * ' ■ r ^ 

' '^During the first year, the EXCHECK proof 
checker will be redesigned not only to function as 
an interactive proof checker, but also to function 
as an interactive theorem prolT-er capable of 
accepting and executing complex proof strategies. 
...Also in the first year, the. mechanisms to 
Implicitly handle sorts will be expanded to hajcidle * ^ 
types. 'The resolution theorem prov^r will.be 
modified to make it recognize 'more proofs that are - 
obvious to the student. ..'.The new introductory 
lessons will be begun in the first year.. .The work 
on the help system will be mostly done in the first 
year... Generally, during the three years covered by 
the proposal, -the search for ways • of inf ormalizing 
^ the EXGHEGK. system will be continued." 

The. work on theorem provers is described in Section 1.3, and, the 
^> • , 

work On types. --is described " in Section 1.4.2. Section' 1.5^1 
describe^' the worfc carried out on the HELP system and the introductory 
lesson. General system ' level work, ijeeded to implement the various 
cHariges to .RXGHEGK described in this report-, are^ oxftlined in Section 
1.1. Section .1.2 and Section 1*4 discuss work done to informalize 



-the EXGHEGK system. 




i. 1 General System. Ifork > - ' " . ' ^ 

The compiler for the* VO GAL author lahguage, the theory processor, 

y ■ ' ■ .. . * ' . f . 

the curriculum driver, the proof phecker, and various auxiTiary programs 
were/restructured in order to create more unif orm^odules out of which- 
the programs could be constructed. This has resu:lted in a considerable 
*^savings in runtime space usag^ as ^well as making it easier to ^enerijte 



and test new versions of the programs'. New methods for specifying the 
parameters of Che various theories were implemented making it easier and- 
faster for curriculum authors to specify theories and also resulting in' 
a faster runtime system. . . 

.The-baslc programing -l^uage CSAILSP--.f or SAIL with a^ ^yer sion^ of 

LISP embedded in it) used 'in the EXCHECK system was modifiea to reduce 
explicit dependence on the L£AP features of SAIL thus facilitating the 
restructuring mentioned above. The; LISP garbage collector in SAILS? was 
improved and methods for more efficient transfer of s! expression^ from 
•file to core and from f9rk to fork were implemented. 

1.2 More Natural Semantics . ' . 

One of the -most difficult problems in writing CAI progratDS-- is ^ 

developing procedures that permit phe interaction to take place at ^ 

natural , semantic level/ larg4 part of earli-er vork on the EXCHECK 

systeii was directed specif.i-caily at this problem. The IMPLIES ru^e is« a" 

■ ° "'' . ' - ■ ' ■ . . . ' 

good example of this ear3.^£er work. ^ It uses extensive Computation and 

heuristics to' ensure that the user ^is not distracted by mathematically 
irrelevant logical^detail. 

' The BOOLE procedure is a" good . illustratidn ' of another method of 
placing .the interaction at a natural semantic level. BOOLE uses a 
decision procedure to determine if a given formula is « a J:heorem.. of 
quantifier free booJLean algebra. Hence,^ students in set theory and 
other -courses can use it to assert tha-t such formulas are true." 

Work has "continued during this last year 'on facilitating 
••interaction "on a semantic level. The REPLACE rule and ^the ABBREVIATION 
procedures were, completely rewritten for this xsur^ose. Other 



procedures'^ particularly IMPLIES and VERIFY, were altered to make them 
correspond more close ly^- to natural inference, . The ELFS procedures 
described below- were irapLemented as part of the.ptogram of extending the 
nonderivation exercises- Also .the VOCAL language is b^ing considerably 
extended "by the graphics and ^speech oriented semantics package described 
below. , ^ ^ . ^ ' 

" ■ ■ . ■ ■. ■ ■ ' '" 'r . ■ -■■ ' 



1*2.1 . ELFS ... '\/ r . ^ 

^ELFS' is an. acronym for Expliciirly Listed .Finite Sets. The name 

comes f roi9^the' fact that ^^^st specify the basic sets by explicitly 

listing Jthe\r^ elements^!- In the current .implementation the structure 

dealt •with is tiie class- algebra over the aet <0,/.\,35}. The language 

handled is the quantifier free language : for this structure* There are 

•procedures available to determine Pthe denotation of' any. term and the 

• ■ >\ ^ ' _ . ' - • 

truth -value of atiy f or'muia in this language (assuming of course that t^ie* 

derfotcitions of yariables has-been d,etermined) . ..." 

Iti a typical exercise the student .'is asked to specif y . ELFS that 

satisfy a given condition* For example the student might be asked to^ 

give sets' A and B such that the cartesian product A X B is "distinct from 

B X A. 'This '"is equivalent to.; Av^s not^ equal to B and both ' are' 

^nonempty. ^Giyen values for A and *B the ^^FS procedures can easily\ 



compute the truth or falsity of the l^tter^rmula. Also 'because these . 

procedures are available in the answer analysis' routines of VOCAL the « 

* A) 

^uthor can "'easily formulate semaritically based responses* to student 
responses. Continuing the- example; ah author might say in effect: if 

■ • • ■ " . - \ . ' " V. . . ^ 

one of the-se.ts A, B, given by the student is empty ask the student what 
is the cartesian product A X B if either A o^ B is empty?'. , . ^ \ ^ 



The'' ELFS represent an iraporcant extension of the EKCHECK/VOC/>lL CA.I 
system because they provide a semantic basics for the illustration and 
interactive -discussion of mathematical concepts. Previously the 
principal constructed responses allowed students were deriva-tions. This 
limrtation Iplaced a premium on skill' in constructing derivations , which 
is only one component of the mathematical maturity that the /CAI ^courses* 
strive td develop*-^ ' \^ ' 

1 . 2\ 2 • Extended Graphics and Speech Oriented Semantics 
We. are currently implementing -a graphic^ aad speech oriented 
semantics . package for the EXCHECI^/VOCAL CAT system to .give authors 
better facilities for presenting material to sstudents informally, 
precisely,, and at a. natural semantic level using a variety of modes. In 
-particular it is now easier >for authors Co illustrate mathematical 
concepts and to specify semantically appropriate responses to student 
responses. Fundamentally this is accomplished by providing facilities 

' • : • ^ '.■/.}■''■ ■ ' , • / ■ . ■■ 

for ^the author tcf specify fipw -forj?iulas and t6rns of-, languages such as 

the' languages for set theory and 4ogic can be Vinterpreted** -visually , and 

aurally. . ' . • 

For example, using the semantics package, a graphic representatiorj 
of a fact about sets can bet displayed f or ' the student. At the same 
tine, audio comments can be"^ made using the computer synthesized- speech 
system. This combination is especially useful for introducing new 
concepts and for testing /the.^ student "s understanding of pX^viously 

introduced conceptp. . " ^ " ^ 

\The semantic^^ackage is designed to be versati3i?fe arid extensible* 
Not only can the^' semantics of languages l^e specified but also the basic 



display and/Sfudio commands available for constructing semantics include 
most of the ^/OCAL commands already available, to the curriculum author. 
Among these basic commands are those which display, erase, brij^hten and 
unbrighten areas on the ^ideo-display- screen, and those which generate 
audio messages. Other VOCAL commands /Mailable for constructing 
semantics include some which require -student interaction. for 
example, it is now possible to display the visual semantic values of the 
subt^erras of a complex term 'separately, each followed by a pause 
requiring a student response before continuing. As a further example, 
using the semantics package authors can easily create exercises -that 
require the studeat to input a term in an appropriate language. The 
student's response tften can be displayed and compared^ graphically with 
correct solutions. ^ 

i 

i.3 Theorem Provers . " ' • 

- — 7~ • ■ - \ V 

Implementation of the new interactive naturaiy^duction th^rem 
proper has begun and the first version should" be f^inished during the 
first quarter of next ye^. Most of work so far has gone into designing 
program control and data structures adequate for an interactive natural 
deduction theorem prover and .yet efficient enough for real time use* 

' ' As an example, a significant operation in the new prover is the 
natural deduction ^.analog of unification: comparing two formulas to 
determine •'which (possibly empty) set of ^ instantiations would make the 
formulas equivalent. The procedure that does this is called INSTANCE* 

iln resolution provers the formulas passed to ^unification are in a rath^ 
simple canonical form. In the new prover ^ IN^!^NCE is applied to 
formulas in -standard form. . Hence there is a question of time 



efficiency.. ' We have been able to" design data structui^'es that permit 
INSTANCE to run efficiently on formulas in standard form. In .particular, 
INSTANCE without identity runs in a time proportion^ to the lisp -EQUAL 
function without Requiring that biconditionals, uniqueness quantifiers, 
^'o^^^er complex structures be converted to a more primitive forni. The 
complexities and increased running time introduced" Sy adding- identity-- to- 
INSTANCE seem no wbrse tharT^cr analogous procedures such as resolution, 
and in fact might be significantly more efficient although . proofs^ 
establishing this have yet to be done. The general data structures ffte * 
also designed to be efficient for identity. In both ca^es the design 
grew out of our work on the TEQ algorithm, work done with resolution 
th^rem provers, and consideration of various back-tracking and parallel 
processing schemes J ' \ • 

Substantial Vf fort has also been -put into designing top level data 
structures and p/ogram control capable of efficiently interacting with 
users and accepting complex proof strategies from them. A major problem 
here is structuring program control, so that interaction can occur for^ 
global or ^.strategic jjecisions about how to^^^!ki^,,the/propf while' leaving 
the pr/cedures that handle routine work .^to functiot^' without 
interr-tiption. The current version employs a multilayered approach to 
handle this problem*. The user interacts with the STRATEGIST which in 
turn controls the routine work via . a subordinate procedure: 'the 
TACTICIAN. A^iother beneficial aspect, of^ this .design is that, very 

roughly, decisions made by the - TACTICIAN are local and have lir^ar 

7 ^ ■ . ■ 

effect while decisions made fey the STRATEGIST 'are global and can have 

more than linear effect. The multilayered design also provides a 

natural basis for the acceptance .and execution of conplex strategies: 



they are passed directly to the STRATEGIST which in turn uses them to 
direct the TACTICIAN in its work. 

/ 

Most of this year's work on theorem provers has been devoted to the 
strategic theorem proveor, although the current resolution prover has 
been maintained and data collected for* use in the deyelcJpment of. any new 

-provers* ^ear - the - beginaiag of _ . this. year , : work _ on. .a substantial. 

revision of thJe resolution prover was "done in parallel with ^ :-ly work 

X ; ' • ' 

r ' ■ ^ 

on the strategic ^?fover. Most -of the results of that work will be 

incorporated in routines ^in the strategic prover. 

1.4 More Informa l Mathematical Procedures 

. Informal Languages' and IProof Summarization \ \ 

The output grammar is being^xtended to inclade'a more informal 

^mdSe. For example^ the extended procedures willV output maps A into' 
B'' instead of ^f: A B/ and "every element of B is^n element of C 
instead of ''(A x>(if x ia B then x in C) 

•Procedures are. being developed that will permit, effective and 

na^iirj^l summarizations of derivations. The procedures first organize 

the 'parts of the proof on the basis of logic .1 dependencies and then 

■ ■ ^ . - ' • ' 

s^knari^e the resulting patts ^or the 'i^^ts of the major steps. When tTie 

rrSw -informal mode of *the output grammar is conpleted it will be doupled 

with the. proof summarization ^rcfcedures to produce informal summaries^ 

proofs.^ . ^ 

1.4.2 Sorts and Types ^. " ' 

.Work has begun on the extension of the sort machinery to handle the 
•'types'* of functions; i.e-*> to extend the implicit sorting'^machiriery to 



compute "not only the sort of object but also to compute descrlptlbns for 
function that code how to compute the sorts of their values j^iven the 
^sorts of their • arguments. This extenston is.^quite difficult to 
accomplish efficiently , as the iatrodyction of' a new type for a function 
could require updating ^the sorts ^nd types "of a Rr-eat many objects. 



1.4.3 Simplification and Computation' 
Work on providing"^ students with more access to the simplification 
procedures available through the REDUCE fork of the EXCHECIC system is 

• n ' . • 

Still in the experimental ' stage. Currently, students can use the 

' * "t * » » 

ALGEBRA command .to manipulate identities. - - 

• * 

^ -Example of the Use of the Current 
Implementation of the ALGEBEIA Rule: * 

, (8) P(A) + P(B) = P(A U B) , > ' ^ . J 

*8al£$EBRA^' \^ ' <: 

Schematic form: ' ' ^ ^ 

r+t - xl . * : ■ 

New form *t ^ xl - V$ 

' - ft . 

Multiplicative factor used ..to get .new "form *^ • ' > 

' '8 ALGEBRA 

(9) P(B) = P(AUB) - P(A) 

■ note: underlining Vindicates student inpirt^ $ denotes "ENTER' key. 

It would be of great Value in the course in the foundations of 
probability to extend the algebra - rule .to handle^- cases involving 
inequalities. . Thus% ^ it ^ould facilitate inferences > ?uch as the^ 
following: ^ . 

- 1, q-t c r-t implies q <. x . 

' 2. q < 0 & r < 0 implies q/r £.0 
, , '■ 3. q < '^Z & q > 0 implies r' > 0 



8 

"12 ■ 



Coin'plex' calculations involvrn§j>- differentiation'*^ and iiitegration yoal^ ^ 
also be of -great use in 'the course/'^and^ could "be itnp'lemented in th^ 
REDUCE fork (see Hearn>for a description . of * the REDUCE- lang'E^ge) . The * 
Student should also be able to do /very simple calculations in 
preparation for -rel^onding to. short answer^* questions^ - . than 
^erferate a call to the REDUCE 'fork ve will :in^rporate a. calcuf^ 



'the upper fork'^of^ the probability course- ^ Qnei^^ml^iLfe. of the "kind of 
Calculkt ions, involved .is the following* ^ ^ " V 

* ' . ' ■ . ' . ; ■ ■.' " " . 

• ' . Students . are . asked .to compare (first, intiritively) 'the pro^bility 

bjE getting^at least on^ 6 in four throws of^'a'die', to the probability of 
; • - . • ' „ . 

throwing- at least one double-siic - iti twenty . f out . throws of ^ P^^^ 

dice; People will often assume, that the .probabilities will *-be the same^ 

especially if is pointed ^out thatr^. 4/6 equals '24/36,» Afterwards the 

student ^ can ' calculate for himself precisely what ' , values ' the. 

probabilities in question have* - They are: > 

P(6 in four throws) T- P(no 6 in four throws) . . 

^ * 4 

^ =1-- P(no 6 in one tt^row)"^. 

^ = 1 - (5/6)^ 

. • = 0.518 

' P(6,6'ia*24 throws) =1 - P(no 6,6 in 24 throws) 

• ' V.24 

'= 1 - P(no 6^6 int one throw/ - 
, . V . • = 1 - F(35/36)^^ 

. V = 0.491 ' 



•1,4^4 Interplay Between Theoretic and Metatheoretic Methods 
. iai the proof theory course results are established in a raetatheory 
(TEM) for Zermelo-Frahkei (ZF) set theory/ In particular, students- ^ave 



to ' establish an the metatheory that, under; given conditions certain 
results are^-ptpvable''- in '-the object theory. The standard mathematical 
metriod for 'doing* this ."as to- simply derive, the result directly in.Jfhe. 
object theory, aacf theo, use- tt^.s fact in the metatheoryi 

J*rt)cedures have 1>een aidded , to' allow students to taake such 

' • . . ■ ' . • ■ ' ' • • 

inferences the proof theory Murse. ^T<ro inference, procedures arg 

■• ' • " . . * ^ ■ r ^ * . . , 

-^involved: ZFSTART for starting, a- derivatic^n in ZF 4rom the metatheory ; 

and "^FFINlSk for finishing "the ''der'ivation in ZF and . returning to. th*fe 

t . • ' . .«■'•, 

me.tathepry. After^ starting, a ZF derivation from the metatheory priori 

results from the metatheory or the metatheoretic part of the derivation^ 

may be referenced from the ZF' part of' the .derivation. In the example 
' • ^ ^ • ■ 

below, two lines in the metatheoretic part / of the der^ivation are 

• " * ». '* 

referenced from the ZF part o£ the derivation. There ^is a restriction on 

' ' r . ■ , ^ ■ -if 

the form of metatheoretic results that may be\referenced from inside ZF: 
they must be^atomic formulas pf the form ZF I- F or conju]^tions of such 



formulas. I ■ 

■ ■ /■ \ . -• 

Derive: ^ ■ 

IF ZF'I- x=y AND ZF |- y=z THEN ZF ]- x=z 



WP (1) *ZF I- x=y 

WP '. (2) *ZF I- y=2 
■ *zfs$T ART 

***A*****it******* ZF **************** 

. >1.2teq$ (3) *x=z$ ^ ^ 
Will you wish to specify? (No) 
.Using *g$0 V ■ 

: #3zff$INISH . ' 

' ■ ■ ■ • ■ ■ • ... . 

3,1,2 ZFFINISH V ^ > ■ , 

(4) 2F I- x=z- . % , , * v.y 



1>5 User ' Aids * - , , ' 

1.-5 Ar HELP System -and Introductory Lesnson 
. * The introductory l^feson tor, t.he^XCHECK coursed was . completed, this' 
summer.- .Student documentation was updated and"- iiiiRr6ved,^i'to reflect; the 
changes in the EXCHECK system. . Previously, the HELP system v was loaded 
^n^o the program' only upon being called by tlxe stu«$6nt. This resulted 
in long waits foi< the HELP files to "b^ read in. The system was revised 
to" permit loading with. - the EXCHECK program, thus reducing ^ the wait 
during a students first call to . HELP. The strongest barrier to 
extending the HELP system to its, full potential ^currently ^r^e need $o 
produce remedial, enrichment, and tutorial curriculum , modules to be 
accessed by the system^ - ^ • 



1.5,2 Shorthand for entering Terns and Formulas. 

A new notation system has been added for referring to parts of 
already accepted terms and formulas. This system provides students a 
shorthand method for entering terms and formulas. 

F9r example, assume 'that a student is doing a derivation which 

contains the following step: . ... 

* • ■ ' • 

(4) A is. a subset of <E: E is in powCCT))} . ' 

■ iff . , ' ' ' ' 

» A is a .subset of f (G^k^ ; 

"•>' ./ • ■ ■ " ' • *^ 

The student could enter the formula ' ^ ^ , 

" not ( A As a subs'et of <E: E is in pow(C^D)> , ^ ^ ^ 

> and ./ ■ \ ' ■ 

A is a subset df-f(C) •) / 

W , % * *^ . . ' • \\. 

by typing: riot ( fm:4:l & fm:4:2 ). The expression "fm:_4:l' deKignates^ 
the .formula that is , the f.irst ' subexpres^sion of the formula onj-ine 4. 



^Simiiariy, "tia:12:2:l'- would' designate the .term that - is, the. first 
s.ubexpression the of the second subexpression o'f the formula on line 

.vX2, • Designators begining with, "f m' ' We parsed by the grammar - as' 
formulas ajid heace can be used in any expression at any point at which a 

• formula ■ ^ . appr'opriatei Similar-ly, designators begining with "to' can 
Wfw^s be used instead of the term designated. . .9/ 



Audio, and Prosodic Features 



The work on AUdio and Prosodic Fea^tbres ;is proceeding^^largely as 
•outlined 6a page -83 of the .origi^U project proposal.. We quote from 
that -page: " , • , f ■ , . 



^ •'In the f irst year^ of this proppsal, we would 
design ; knd build the' 'MINI-MISS maphine, impiement 
some 6$ ' the data-compression techniques, atrd do some 
preliminary studies on , the quality of the audio we 
are producing. We would, continue improvements to 
prosodic contours. -and- the syntactic .parser and add 
prosodic contours. ' la ^ particular, the syntactic 
''parser and contour-generation- producers would be 
reimplemented for efficient on-line generation of 
synthesized speech^ \ ' 

^- ' In the secotlcj year, we would .... simulate the 
coai;atenation of affix,es to root words." 



mpre s's i^»— t e,c 



The work done on data-comi>res'si^»-te,chniques in^onnection with the 
design of " the MINI-MISS^ machine is described in Section 2.1. The 
preliminary studies: of prosodic quality are discussed .in Section 2.5. 
We' also discuss improvements to the syntactic parser (see Section 2.2) 
and prosodic contours ,(see Section 2.3). 

As stated abpve, we expected to delay work ^n an. algof'ithm for 
concatenation of affixes to root words until the second ' year of the 
grant an'd to;' emphasize on-line prosodic generation in the first year. 

\ , . . • S . . . . 

As explained iti ^Che "interim report, it was ne^ssary to exchange the 
,two. Our work this -year on Qoncatenation is described in .Section 2.4, 
and our -plans for on-line generation of prosody are -discussed in Section 

2/2.2." . ' • • „ . ' . ' 
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2*i . Data Cbmpression Techniques ^ 

The present mISS synthesizer is not ^suitable for duplicartion fpr 

iise at' sites Veafote^from Stanford. The use of LSI integrated circu^-ts 

■ • ^ , ■ • . . . ,^ . , 

and the femovai of the generalised featrifre^- 'that, are' -.necessary ^ for 

' '"'^ V , * . * ' * . r ^ , 

speech research will -allow us to build . a m^lI-MISS machine which would 
' • • '« # .* . • 

fulfill, the. requirements; of a small cluster of remote . terminals for 

* ' ■ * ' ! 

using audio. The speec^ parameters woul3; be transmitted to this MINI-, 
MISS machine via the same data-line connection that the tenriinals would, 

.. .. , ^ \ 

use, although that data line would pequire a higher bandwidth to handle 
the additional speechrparameter load.-^. The . same high speech quality and 
prosodic manipulations available in the full MISS system would be 
provided to such remote users with the MINI-MISS machine'. ' 

Integral to the design of . the MINI-MISS machine, we are attempting 
to utilize the results of refcent research in speech synthesis. In 
particular, new techniques for improving the .quality of the synthesized 
speech and^ecreasing the storage size- required to represent the speech 
•have beea extensively studied. We have been duplicating these 
experimental findings invthe context of the MISS system^. The importance 
of conducting these studies before, committing ourselves to a design for 

the MINI-MISS machine can be demonstrated with respect to two specific 

^ • ■ ■ ■ ' - . . 

issues, log^ area 'ratios and covariance analys^is. The use of delta- 
encoded log ^rea ratio reflection coefficients .holds great promise in 
substantially ^reducing * our storage requirements- arid the transmission 
rate for the^MINI-MISS machine. The digital filters required for' audio 
synthesis using th^s€N^fficien^ is,)however, incompatible with thos^e* 
designed for. tlr^ LPC. parameters we/are currently using. Thus it is 
critical' to' ^aluate the potenti^al usefulness of these coefficients 



before designing and' ^specifying one of the central parts of tKe new 

' ■ " " •. ' . ■ :\ ^ ■• . V. • 

machine. " * • ^ , ^ , ' 

V 'Similarly, the use of^ covarianc-e linear ^predictive ^analysis instead* 

- — ■ " ^ ^ 0' . . ^ * ' ^ ' ' 

of^auto-corr-^latxon linear' predictive analysis would, i-mprove the quality 

> * . ^ ■ ' * V • 

of the synthesized st)eech. The cov^riance 'techniques sotnetimes ^.generate . 

^ ' . j» ■ .■ ' ■ . . ^ - ■ '. ^ . 

iastabij-i'ties in_, the- 'speech parameters, which, unless /overcoaeV 'would 
make this technique unsuitable for our oper.ational syjathersis . system.^ We 

•have found that ^he 'JLiterature on covariance analysi^\ understates the 

■i ■ , ^-^ / 

importance of these instabilitiei'^ and we are experiment|.ng to discover 

* • ""^-y 

if they can be minimized sufficiently. Again, in this case, the design 

of the new machine is dependent on the results of our experimentation 

■ ■■ ■ , ' ■ . ■ ^ • . 

and duplication and evaluation of current research results. 

. *^ 

. * - ' 

2.2 Improvements to the Syntactic Parser - 

<f . . » . ✓ 

We have increased the complexity of t^e synt;actic' sub-componenp of 
• " ■ ^ ' ^ * ■. - 

the prosodic system in order to expand the range, of constructions which 

we can accommodate. Most of these extensions are not notable as 

discoveries of new ways of. parsing but. are 'rather the outgrowth of 

selected review of the structures that were generated by the previous 

'parser. ' One • extension that is interesting is the recognition of 

different grammatical f^inctions. for words in some word classes. In 

addition, streamlining the syntactic sub-component will be useful in the 

■■ . ---^ /' ■ . 

on-line generation of ^1 prosodic contours. 

.■• — 

' 2.2.1 Multiple ' word functions and^omographg 
There are many cases of homographs, Lords spelled the same, but 
pronounced differently and having different grammatical functions, • in 



Erigli-sh. Consider the case of. two teases of the word "rea*W. >Iq the 
present 'tense-, the- word is homophopous with ''reedl'V' while i^^^e pas^ 
tens^- it is homophonojis with '"red." A laiige ilass >a%^dj€citives- and 
ve'ctfe with videhtical' spellings differ in' pronunciation depending 'on. 
whicti grammatical function they fulfiaiX* - Many.no^uns. and, verbs a.lsp have 
this' property*-^ Some . examples ' of ,adjie,ctives ^anS^ yerbs- with -i^edtic^l 

spellings. are: ^ . , ^ , 4*. ' - 

1 ' • ' ' 
present (adj: current, vb: to give) , . ^ - 

alternate (adj: other; vb: to switch between 2 choices) 

/ , compact (adj: mathematical 'Concept applies, vb: to compress) 

^ /-^ ■■ . . ^ - y 

Examples of nouns and verb's are: 

affix (noun: prefix or suffix, vb: to add^ something to, a base) 
. ; content (nouti: material contained, adj/vb: happy/to make hapfjy) 

\ ^ " V,, . I 

We have improved our syntactic analysis so that ijL chooses correctl^^in 
\ - • . .. - ; ' 

most cases the, correct grammatical function and conseqaiently the correct 

sound for these homographs. In most cases, ^we have also had to create 

one of the two forms of the spoken word. This aspect is discussed more 

fully>,below. Section 2a4, 

2.2.2. . On-line generation * ^ ' 

While we had initially expected that a greater importance would be 
placed on the development of an on-line generation algorithm than on 
word formation, the actual course of research has exchanged the two* 
Our preliminary plans for on-line proof summaries in audio are to 
precompile, in parameterized form, the messages to be generated. For 
example, consider a message type such as "alpha proves that beta holds 
when c^amma is also true'% where alpha s beW and gamma are ^place holders 
for {potentially comp^lex phraaes. A parameterized form of synthesizer 



. 1 



'This «word can also be'a noun, meaning , 'a gift'. 



conunands tiiat will generate corcSect'ly the static parts of ttfe message 

^ , ■*■ " ■ . ■ ^ ' . '. ■ . .V • . - - 

and -give a- good approximation for speaking the specific material -filling 

X in 'the'. pT-a^e holders will be stored with^sthe routine f or " sel^ting the 
»•'•.* . . . " ' . ■ ■ ' . 

. messager^ During the next year of the grant', we will implement this 

. .rgtiCine and eyal^iate if,, trying' to est^imate the cost -in processing -time 

-•/resul;^ng in. various grades of_^.audio quaiiity. "^^"^ .* 

4- .. .. - ^ ^ ■ 

' . ■•• V ■ : ■ / 

. l.Z^ Prosodic Coritburs 

' TKfe rn^ior development in our \ pTtosodic contouring was the 

» , ■ '■ ■ I • 

implementation of ratio intervals' on 'the pattern of musical notes for 
■ -the pe^ pities which we assign to >^ch word in an utterance. Fuller 
discussion of this idea is presented in .Levine (1977)*, but the, simple 
idea is that the^ targets, for the pitches in an utterance take the 
relationship of musical intervals to each other, ^It has beenvtioticed 
> (Lib^rtnaa, 1975> that in children 's chants, the intervals between wor^s 
are approximately minor thirds.^ We have carried tfiis idea further and 
are using ratios ranging from a quarter tone' to a perfect fifth and 
octave in generating our prosodic parameters. Using musical intervals 
does not mean thaC dhe words are sung. We expect that the musical 
intervals sound -more natural than the previous assignments that we were 

K • • • 

using. , 
^ Ware curreaCly trying^ to' experimentally determine whether rfiere 

is *a^ significant improvement in student acceptance -of this form of 
prosodic gene-ration over the previous (non-musical) system. The actual 
• difference in values of parameters is quite small, and, while the new 
version did seem to us to be qualitatively^'better > we have not^ yet been 
able' to adequately prove our point. The, proof^ is especially elusive 



since the complex' factory affecting listener acceptance of ^ ttiis speech 
prevent us from isolating 'the response t6 the praVticular phenomenon we 

' - .^W/ . ■ ■• ' ^ )■ ' ■ ' ■ ■ ' 

are invest igating/X ^ ^' * ' ' ^ 



We also' presented some results, comparing our audio, sys't em with 
othei/ potential and actual systems,-at the . Acoustic- Society of America,, 
94 tH .meeting.' ^ One <^aper' Showed that . our word-basetj prosodic model 
corresponds, well to* theoretical predictions developed by Klatt . (1975) 
-A sfecond paper compared the -rate at which school aged children are able' 
to adapt to synthesized speech. 

r 2.4' W6rd formation by concatenation . ■ ^ ■ . • 



- Ivf the first year of thte '. grant ,we have begun to . develop an 
algorithm that JilL^utomatically concatenate roots and affixes in* order 
to produce j/t3lrds waiich were not recorded for our lexicon. There are'two 



situations in. CAI where such ""new' words are required. One 'is for 

meeting the needs of curriculum au theirs who may modify existing lessons 

or design hew ones which may introduce new terms or use forms of words 

which were not in the old curriculum. We mentionaed above the need^ for 

having both spoken versions of homographs available t.o. the syntactic 

parsing system. - Since the lexicon initially contained only one spoken 

version of each word we have had to* fo^m tt!e other ourselves. The other 

need comes from interactive prosodic generation. In its most general 

form,' the interactive component will use words which may not have been 

( . . ^ ■ . ■ 

in the curriculum, in order to respond utilizing the student 's own terms 
■ ^ ' ' . ' " ' '. . 

and phrases. Our approach to the production of ne^ words has been to 

develop progressively automated techniques for concatenation. The more 

rudimentary techniques, . aid us in expansion of the lexicon for use by 



curriculuia authors in .modifying less^Sns or producing new, ones. 

^ ^ * . .J ■■ ■ ■•■ * 

current procedures' are suitabife^ for inolusion into the • next, more 

complex, stage of concatenation. 

^ ■'. 

2J^.\ . Res'e arch oa Concatenation . 

;•■ " - . .. ■ . ■■ . ' 

"v ' 4 0*i£ f irst step in this research^Jtf^as to duplicate the initial 
research done by Lovins and Fujimura. This work^uses a special set of 
"demisyllables", each sounding- like half a syllable, which can then be 
' concatenated with' each Other to form a regular syllable or word. For 
example'^ the ''bt-" taken from the '"bi^" might be combined with the "-it" 
taken from the word *'sit" and concatenated to. form the word "bit." We 
have based pur general procedure on this concept with the exception of 
certain affixes (such as the plural and ."-ed" endings) which are better, 
treated as combining with a full root word whose last segment has been 
slightly shortened. ^ - ' 

One ^oblem tfiat /became- apparent when we imp lemeated this approach 
is thst since there is a ^neeJ for a linear smoothing of the LPC 



c 



oeffic^ents at the concatenation boundary, we have to be careful about 
the resulting segment amplit;ud^. The coefficient smoothing creates a 
"new" segment at the boundary as part of the smoothing process, but we 
do not yet -have .a good predictor of loudness for that created segment. 



Thus we are subject to the introduction of clicks and .pops at*, the 
boundary. ' We are experimenting with an alternative approach to 
amplitude for incorporation into the KINI-MISS machine, which should 
simplify this problem. ; . : 

Another aspect of concatenation that needs to be considered is 
assuring that the reciting word will have the appropriate duration. We 



are currently- examiaing the algo'rithm from Kl^tt . (:i975), in 'tliis. 
respect. 'We are/ also developing automatic procedures, for "identifying 
syllable- boundaries. These boundaries' are critical for the development 
of the set- of demisylla-blgs, and usable root, words. Currently, we use^a 
measure- of spectral distance, or how constant the scJund parameters are,- 
to determine where the vowels and syllables are in each ward, but this 
attempt - will certainly need to be complicated • to include more 
sophisticated linguistic information •^/^ 

2.4*2 Additions to the Lexicon 

While developing the automated procedures for. word formation, w6 

J" ■ ' ' ' ■* 

are already using the sub-procedures at hand to form the words which 

.-. . ■ \ . . . ■ ■ •■■ 

various courses' required- In particular, in the first year, almost 200 

words were formed by hand, using these sub-proc.edures. In general the 

method has been to start with a sequence of words in the vocabulary 

'which will -each contribute ^^part of the desired word. An aut^omatic 

sub-procedure is used to create the appropriate demi-sy liable boundaries 

for each word in the sequence. The boundaries then may be adjusted by 

hand to improve the concatenation. Finally the actual concatenation, 

with pitch, duration and loudness adjustments (but without segmental 

staoothiBg at the boundaries) are made. The newly formed word is stored 

in 'the general vocabulary, but is marked as specially formed. Once in 

the vocabulary, the word is used just like any other LPC analyzed word, 

for sentence generation. ^ , ' 

2*5 Study of Prosodic Quality ^ 
During the course of this year we have been gathering data'.pn 



comparisoq of student preference for audio with stored digitized phrases 
as against audio with . automatically, generated prosodic xontours^^. The 
experiments involve both direct, comparison* ^nd indirect compaifison via 
a third nonaudio mode. Preliminary results seem- to indicate t'hat 
studeti€ preference for audio as against noti-audio mode is no less when 
when Che audio mode is the synthetic prosody, Hhan when the audio is 
canned phrases . .. . 

Further data is being '-^gathered during the current quarter of the 
introductory logic course. More detailed analysis will be carried out 
on this years data during the finari quarter of this year of the grant 
and inrthe first quarter^ of the next year. 
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■ y • • ■ ■ . • 

' 3 ' Schedule o ^ R . Proposed Research «^ . . ' 

3*1 laforml Mathematical Proce'dures. . 

, ~~ ■*■ ^ -""^ 

• • The schedule for completing research ori^irmlly proposed for the' 
•secoad year of the grant is essentially the same as that given, on page • 
82 of the original technical proposal. ^ ^ ' 

The primary goal for the second year is to finish implementation of 
the first version of the interactive natural deduction theorem prover . 
and to refine ' and ^ extend it on the basis of data from use in the 
courses* The . basic language for stating the proof strategies to be 
accepted by the prover wili be implemented^ and informal languages for 
proof strategies- will- Ke^ investigated (see pages" 41-50 of the ^original 
proposal for 'details). The basic interactive mechanisms for des cribbing 
partial proofs to students and receiving further instructions will be 
- implemented (see page 50 of the original proposal) . Refinements of these 
methods will be studied and evaluated on the basis of data collected. 
Procedures' for giving dynamic guidance to the student based on how the 
natural deduction pr:over would continue the students current work will^ 
b^^investigated and if possible implemented at the end of the second ' 
year. . - . 

J" « 

The computation procedures available to curriculum ^jithors will be 
' - extended in order to broaden and deepen the- range of possible exercises* 
«./ Particular .attention will be given to assuring that student interaction 
with the /ystem during 'these exercises is at. a natural semantic level. 
In particular,, /mechanisnis/ will be added to evaluate expressiotVS' in 

particular structures or theories (where feasible of course). For 

» . ■■ . » ■ ■ ' * . ■ ■ 

* . * « 

'example, truth or validity in class , algebras over finite domains, small 
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finite structures with' a binary relation, etci For further details see 

pages 39-40,55 of the original technical proposal* 

'The' general work on. inf prmalization of .the inference machine^ry and 

-placing -the--- user /sys tern. JLaterac seniantic le^^^ 

^ . . . " . • \ ' >, > ■ 

continue. In particular, work on irjforinal output languages and informal^ 

' •. • ■ . ^ 

proof sunnnarizat ions will be continued, as described on pages 39-40,51- 

54: of the original proposal. , 

Also general system work aimed at improving space or time 
efficiency will be continued as will such work aimed 'at .facilitating use 
of the system by curriculum authors (see pages 39-40,53,56-57 of the 
original proposal). 




3.2 Audio and Prosodic Features- ' ' . 

■ ■ ^' ., - . ■ * ' . ' ' ' ' 

The schedule for completing research originally proposed for the 

second year of the grant is- substantially the same as that given . on- page^ 
83 of the original technical proposal, with one exception. Work on ^ 
concatenation of affixes and roots,- originally proposed for the .second 
year, was moved to the first year in . order to meet theV need for more 
vocabulary in the dictionary of stored .sounds. The reasons for thi^ 
.change are discussed in the semi-annual Interim report. Because of this . 
change we have rescheduled the efforts- to implement efficient on-line 
generation of synthesized speech, as described in pages 63-65 of the 
original proposal. " 

In the next year of this grant ^ will also test the remote MINI- 
MISS machine in the\field, and continue to implement new LPC techniques 
(see pages 58-59 of the original proposal) .V,We will continue to study J 
the relation :of the quality of the synthesized speech as described on 




pages 65-66 of the original proposal. The ''semantic" analysis and story-* 

view analysis (see pages- 61-62 of the original proposal) > will be added 

' .. ' \ - 

to the total p-rosodic analysis. Work will continue on concatenation and 

-J- ' - ' ' ' . . ' 

i^roving the algorithm for digitizing individual words (see page 63 of 

the original proposal) . ^ . ^ 

. .• , ■ . ■ \^ : • ■ 
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